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(57) Abstract 

Microsatellite markers are provided which are useful in identifying linked markers for canine genetic diseases and traits. The 
microsatellite markers are derived from regions of genomic DNA which contain a repeat motif, flanked by unique sequences. The number 
of units contained within the repeat motif is variable, such that various different alleles are present in any given population. The microsatellite 
markers and their progeny are especially useful in detecting genetic diseases not phenotypically visible and identifying carriers of recessive 
diseases, as illustrated in the figure. In a preferred embodiment, microsatellite markers are provided which may be used to delect the canine 
copper toxicosis gene. 
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MICROSATELLJTE MARKERS FOR IDENTIFYING 
CANINE GENETIC DISEASES OR TRAITS 

FIELD OF THE INVENTION 

This invention relates generally to genetic markers and methods of making 
5 and using such markers, and more particularly, to a microsateflite marker that may 
be used to detect copper toxicosis in canines. 

BACKGROUND OF THE INVENTION 
Due to inbreeding and the relatively shallow gene pool, a large number of 
genetic diseases are present in dogs (Clark, R.D. et al. t Medical and Genetic 
10 Aspects of Purebred Dogs (Forum Publications, Fairway, KS) (1994) and Robinson, 
R., Canine Pract. 16:29-34 (1991)). Some of these genetic diseases such as copper 
toxicosis in the Bedlington terrier breed, are so prevalent in a particular breed that 
the mutant allele frequency may be higher than that of the normal allele (Heritage, 
M.E. eta!., J. Small Anim. 28:1141-1151 (1987); and Yuzbasiyan-Gurkan, V. etai., 
15 Genomics 15:86-90 (1993)). Other genetic diseases cross many breeds, as 
exemplified by progressive retinal atrophy causing blindness (Barnett, K.C., Adv. Vet. 
Sci. Comp. Med. 20:9-67 (1976)) and hip dysplasia resulting in painful and crippling 
arthritis (Corley, E.A., Small Anim. Pract 22:570-593 (1992)). 

Canine copper toxicosis (CT) is an autosomal recessive genetic disorder of 
20 copper accumulation which results in severe Jiver damage. Unless specific anti- 
copper treatment is instituted, affected dogs die by three to seven years of age. 
While reported in several breeds, it is best characterized in Bedlington terriers, with 
the frequency of the defective gene being estimated at 50%. The disease is also 
prevalent in the West Highland White Terrier and Keeshond. 
25 Currently, the only method for diagnosing affected CT dogs is by a 

quantitative liver copper assay in a liver biopsy sample, after one year of age. 
Unfortunately, heterozygous and homozygous normal animals are indistinguishable 
from each other by this test. In order to determine if a dog is a heterozygous carrier, 
test-breeding strategies must be employed which require that there be a dog of a 
30 known genotype to breed against the potential carrier. This process is very costly 
and results in the birth of many affected individuals. It is therefore impractical for 
breeders to identify breeding stock free of the gene and currently carriers of the 
gene are only identified after they are found to be the parents of an affected dog. 
Because like CT, many of the canine genetic diseases are recessive, various 
35 methods have been investigated which would identify, on a molecular level, 
phenotypically normal carriers. One method that has been employed is the whole 
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gene subtraction method. This approach requires the sorting out of differences 
between DNA from those with or without the disease or trait with molecular 
manipulation methods. Unfortunately, this technique is somewhat impractical and 
requires that all variability within individuals with the trait as well as the variability 
5 within those without the trait independent of the trait, be differentiate from the one 
or few that are dependent on the trait. Furthermore, this method has only been 
demonstrated on very simple organisms such as yeast, and while this technique 
appears theoretically possible for higher species, it rapidly becomes impractical as 
it requires many breeding studies of large numbers of affected animals. 
10 An alternative method, the use of restriction fragment length polymorphisms 

(RFLP), is extremely labor intensive and expensive with respect to both 
characterization and analysis. Furthermore, th.s technique requires large quantities 
of DNA, generally is limited to only two alleles, and only a few loci have thus far 
been characterized for the canine genome. It appears that with this method, a 
15 separate genetic system must be generated for each breed of dog. and such a 
library may not be sufficiently variable in most situations of interest. 

The randomly amplified DNA fragment length polymorphism (RAPD) 
approach uses random primers to amplify fragments of genomic DNA that vary from 
individual to individual within a species. While the primers are relatively easy to 
20 generate, the method is unreliable with minor experimental changes resulting in the 
resolution of different DNA band patterns. Furthermore, only a few such bands have 
been characterized for the canine genome. 

The candidate gene method is another alternative wherein one or more 
candidate genes is identified based on what is known about the biochemical and 
25 clinical or other phenotypic attributes of the disease or trait and information about 
similar conditions in another species where a gene has been identified for a similar 
trait. This approach was taken in evaluating genes linked to the Wilson's disease 
gene in humans, a disease similar to CT. Unfortunately, the genes linked to the 
Wilson's disease in humans were not linked to CT in dog (Yusbasiyan-Gurkan, V 
30 et al., Genomics 15:85-90 (1993)). Thus, even under the best-case scenario, the 
candidate gene method is merely a guess and the approach is of course, further 
limited by the availability of identified genes. 

Because canine pedigrees for various genetic disease are abundant, with 
several generations and two or more affected members present in many cases 
35 these pedigrees lend themselves to linkage studies, provided polymorphic markers 
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are available. Since most of the breeding is controlled, identification of linked 
markers would allow concerned breeders to greatly reduce the incidence of these 
diseases in future generations. 

One type of marker that has been developed consists of simple sequence 
5 length polymorphisms (SSLPs). SSLPs arise from a varying number of repeats of 
a simple sequence, such as a dinucleotide repeat at a given locus, and have been 
reported to be frequent in most eukaryotic genomes (Tautz, D. et al., Nucleic Acids 
Res. 12:4127^138 (1984)). Such loci, also referred to as microsatellites (Tautz, D.. 
EXS: DNA Fingerprinting: State of the Science 1:21-28 (1993)), are best exemplified 
1 0 by those containing the (CA) n motif and are found to be highly polymorphic in many 
species and are being successfully used in the construction of genetic maps of the 
human (Weissenbach, J. et al.. Nature 359.794-801 (1992)), mouse (Dietrich, W. et 
al.. Genetics 131:423-477 (1992)). rat (Serikawa. T. et al., Genetics 131:701-721 
(1992)) and bovine (Barendse. W. et al.. Nat. Genet. 6.227-235 (1994)) genomes. 
15 High polymorphic information content and amenability to analysis by polymerase 
chain reaction (PCR) and thus to possible automation, make microsatellites excellent 
linkage and mapping tools. 

CA microsatellites from the canine genome have been identified and their 
polymorphism evaluated on sets of unrelated dogs (Holmes, N.G. et al.. Anim. 

20 Genet. 24:289-292 (1992)) or mixed bred dogs and beagles (Ostrander, E.A. et al.. 
Genomics. 16.207-213 (1993)). Presently there are about 150 SSLP-type markers 
for the canine genome available. Unfortunately, these known markers lack the 
ability to detect a linked marker for any genetic trait, because of the low probability 
of finding a linked marker sufficiently close to a given genetic locus, to ensure 

25 detection. Many purebred dog populations have a relatively high level of inbreeding 
which makes it important that such markers be very polymorphic. Further, important 
genetic diseases occur across many dozens of breeds, requiring the markers be 
polymorphic in most, if not all. breeds with many different breeds having varying sets 
of genetic problems. 

30 it would thus be desirable to provide a method for identifying genetic diseases 

and traits in canines. It would also be desirable to provide a method for identifying 
genetic diseases and traits in canines which has high variability and low breed 
specificity. It would further be desirable to provide a method which allows breeders 
to select and breed for certain favorable characteristics, or conversely, to avoid 

35 unfavorable diseases and traits. It would further be desirable to provide a method 
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which allows the detection and screening* 

would further be desirable ,o prov,de . 1^^^!^?^ ^ " 
n-ase or tra „ and affected jndjvidua(s mer °' a3ene,iC 
5 expehments. „ would a(so be desirab|e ^ « •■*»« <es,-breed,ng 

Pence. „ would further be ^ ^TZSZTT ^ 
canines. marker f or the CT gene in 

SUMMARY OF THE INVENTION 

A set of microsatellite markers are provided 

n.neteen m,crosa,e„„e DNA markers are p^J^ST T" ^ 

un,„ue fiaaking sequences may ^ used as po , ymerase y c 9 hajn ~ ^ 
pnmers which atiows tor the rapid amplificafion and charaaerjzation of ^ 
from a sm a „ amount of DNA. Thus, eech m,crosa«e„„e marker has a 
pnmers. The microsatellite markers end .heir progeny are espec ially use u n 
0 de.eCn, genetic dieses no, P heno,ypica„y visibie and Uemfying " ■ ^ 
recessive drseases. ,n a preferred embodiment, micrpsatetlite markers are provided 
wh.ch may be used to detect the canine copper toxicosis 9 ene 

in addition to identifying can,ne genetic diseases such as copper toxicosis 
the m,crbsate„i,e markers may a ,so be used to create . genetic m a p of the canine 
genome generate specific breed P rofi,es. settie parentage disputes and identify dogs 
by DNA fingerprinting. Pedigrees of affeCed individual, their sibitngs. parent and 
progeny can aiso be created. Breeders and owners can thus choose breeding stock 
.hereby reducing and possibiy eliminating the incidence of specific genetic 
diseases. 

Additional objects, advantages, and features of the present invention will 
become apparent form the fotlowing description and claims taken in conjunction with 
the accompanying drawings. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The various advantages of the present invention will become apparent to one 
skifled in the art by reading the following specification and by referencing the 
following drawings in which: 
5 Figure 1A is a bar graph showing the average and standard deviation of 

heterozygosity percentages across loci within a breed; 

Figure 1B is a bar graph showing the average and standard deviation of 
heterozygosity percentages across breeds within a locus; 

Figures 2A-2D are photographs of gels showing marker locus D02011 in 
10 various breeds; and 

Figure 3 is a photograph of a gel showing segregation of alleles at the 
C04107 locus in a Bedlington terrier pedigree. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
Five hundred and nineteen microsatellite markers from specific gene loci are 
15 provided which are highly variable within and across many breeds of dogs. The 
microsatellite markers of the present invention comprise a repeat motif e.g., (CA) n> 
found in the canine genomic DNA, flanked by unique sequences. The unique 
sequences (also referred to herein as primer pairs) may be used as PCR primers, 
allowing the rapid amplification and thus detection of the sequence of interest in a 
small DNA sample. Table 2A sets forth the microsatellite markers of the present 
invention. The microsatellite markers and their progeny are especially useful in 
detecting genetic diseases not phenotypically visible and identifying carriers of 
recessive diseases. 

In a preferred embodiment, microsatellite markers are provided which may 
be used to detect a carrier of the canine copper toxicosis gene. As further set forth 
in Specific Example II below, marker locus C04107 may be used to predict the 
inheritance of alleles at the copper toxicosis locus. C04107 has also been used to 
locate two other marker loci C04107B and C04107C, which either singly, or as a 
group, may also be used to detect the copper toxicosis gene. 

The method of the present invention is useful for identifying disease free 
individuals (homozygous normal), carriers (heterozygous) and affected individuals 
(homozygous affected) at any stage of development. While a single marker may fail 
to provide the required information in any particular pedigree, a series of progeny 
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markers will, and thus such a famifv of 

SPECIFIC EXAMPLE I 
Materials and Methods 

Isolation ana- Characlenzalion of Microsatellite Loci E,h> w „ 
were used for the Coning and screening procedure as 2 I T 
a,.. Molecular Clonina. A Labored ZuT ( ZZ Col s <Sambr °° k ' " « 
Springs Harbor Laboratory Press, (,992), r ^ Harb ° r: Cola 

Penphera, blood sa m p,e fl a .^L^ ™, ^ <™ ' 
0 H,. s*e seieced fragments punfied from ^Z^ ^^"^ 
Corp., Chatsworth. CA, were Coned into the phagemid vector ^ ( S, . ^ 

> and hybridized with ,ra\ . prepared, denatured 

yonaized with (CA), 6 oligomer, labeled with 32 P dCTP * 

.-(erase Posi«ve coion.es .ere picked with a sterile pipe" Tp and "nTo 

, o, a solufion consisting o, 1% Tnton x 100 , 2Q mM ^ ^ P 

pnmers complementary ,o the T3 and T7 promoter sequences which Jl' 

cha.n reacons tor 30 cyCes ot 9,. 55 and 7 2 -c a, 1. 2 and 3 min.. respect 
after an ,n rt ,a, denaturafion a, 94X for 4 min. The standard buffer. nudele and 
pnmer concentrations were 50 mM Tris-HC, (pH 8.3 a. 25-C). SO mM KC, 1 5 mM 
MgC,,. 2 00 „M dNTPs and 40 pmoies of each primer in 100 „ reacions ' PCR 
reactions were earned out on either a Perkin-E.mer Cetus (Perkin Elmer. Corp 
Norwa,l,CT, or an MJR PTC-1 00 thermocycier ,MJ Research. Wa.er.own MA, To 
carry ou, secondary screenings of .he Cones, aliquots of .he amplification produCs 
were run on 1.5% agarose TBE ge!s ,90 mM Tris. pH 8.3. 90 mM bone acid 2 mM 
EDTAX Sou«hem b,o, analysis was carded ou, on ,he geis after transfer to 
Gene-Screen Pius membranes (MEN. Boston. MA, using the alkaline transfer 
protocol. The membranes were probed with (CA,,. oligomers. 3' end-labeled with 
d,gox,genin-dUTP using ,erm,nal transferase. A chemiluminescence detection 
system based on Lumi-Phos 530 as a substrate was used to detect pesitive 
hybndization signals following the recommendations included in a commercial ki, 
Gen.us (Boehringer Mannheim Corp., Indianapolis, IN), The membranes were 
washed to a fina, stringency of 0.1 X SSC (1 X SSC = ,5 mM scdium chloride 1 5 
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mM sodium citrate) at 65°C. The blots were than processed for immunological 
detection as described by the manufacturer. Once a clone was confirmed to be 
positive, the corresponding amplification product was then purified using QIAEX 
beads (Qiagen Corp., Chatsworth, CA) after electrophoresis on TAE gels (40 mM 
5 Tris acetate, pH 8.3, 2 mM EDTA) and directly sequenced using cycle sequencing 
(Delta Taq 2.0 Cycle Sequencing Kit, United States Biochemical Corp., Cleveland, 
OH). The sequencing reactions were carried out according to the manufacturer's 
instructions with the slight modification that T3 and T7 primers labeled at their 5' end 
with »P ATP (NEN, Boston, MA) using T4 polynucleotide kinase were used as 
1 0 sequencing pnmers. Sequencing products were analyzed by electrophoresis on 6% 
polyacryiamide gels containing 8M urea. The gels were dried and exposed to 
X-OMAT X-ray film (Eastman Kodak, Rochester, NY) for 1-2 days and developed. 
Primers flanking the repeat motif in each insert were selected to minimize hetero- 
and homedimerization; occasionally, the computer program Oligo (National 
15 Biosciences, Plymouth, MN) was used to help in the primer selection. The primers 
were synthesized by the Michigan State University Macromolecular Structure Facility. 

Dog DNA Panel. To check the usefulness of microsatellite markers within 
and across different breeds of dogs, a dog DNA panel was established. The breeds 
to be included in the panel were chosen with consideration given to the diversity in 
20 origin and function of breeds that exist. Table I presents various characteristics of 
the breeds chosen for the dog panel (Alderton, D., The Eyewitness Handbook of 
Dogs (New York: Dorling Kindersley) (1993); American Kennel Club, The Complete 
Dog Book (17th ed. New York: Howell Book House) (1985); Clark, R.D.. Medical and 
Genetic Aspects of Purebred Dogs (Forum Publications, Fairway, KS (1994), 
25 Walkowitz. et al., Successfuly Dog Breeding (2nd ed., New York, Howel Book 
House) (1994)-; and Lee. M.P., The Official Book of the Scottish Terrier (Neptune 
City. T.F.H. Publications p. 158) (1994)). Five to ten individual dogs from each 
breed were selected for inclusion in the panel. Pedigrees were investigated to 
ensure that only dogs that had no common ancestors through four generations were 
30 included for independent representation of alleles. Ten, apparently unrelated, mixed 
bred dogs were also sampled. DNA was isolated from peripheral blood as 
previously described (Sambrook, J et a!., Molecular Cloning. A Laboratory Manual. 
(2nd ed., Cold Springs Harbor. Cold Springs Harbor Laboratory Press) (1989)). 
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Analysis of Microsatellite Variability. Amplification of the correct target was 
verified by comparing the product obtained from genomic DNA to that obtained from 
the reference clone. The variability at each locus was tested by amplification of DNA 
from the dog panel. PCR conditions were 35 cycles of 94°C, optimal annealing 
5 temperature (50-60°C) and 72°C at 1, 1, and 2 min. f respectively after an initial 
denaturation at 94°C for 4 min. in the standard PCR buffer conditions described 
above. 100 ng of genomic DNA was used as template in each reaction. 10 //I of the 
PCR products were analyzed by vertical electrophoresis using a modification of a 
SDS-PAGE (sodium dodecyl sulfate-polyacrylamide gel electrophoresis) protocol 
0 (Laemmli, U.K., Nature 227:680-685 (1970)) as described previously (Tas, S., Anal. 
Biochem. 188:33-37 (1992)). An HSI SE600 vertical slab gel electrophoresis system 
(Hoeffer Scientific Instruments, San Francisco, CA) connected to a cooling unit was 
used. The gels were poured between 16x16 cm. plates using I mm spacers. 1.5% 
acrylamide stacking gels of 2-3 cm were used on top of 12.5% acrylamide 
5 separating gels with 30:0.8 acrylamide to bis-acryiamide ratio. The gels were run 
at 40 m.A through the stacking gel and than at 70 mA thorough the separating gel 
until the bromophenol blue dye reached the end of the plates, for approximately 4 
hours. The amplification products were visualized after silver staining with the Silver 
Staining Kit (Bio-Rad Laboratories, Richmond, CA). This procedure resolved 
differences greater than or equal to 4 bps in the size of amplification products in the 
75-250 bp range. 

Results 

Screening 1 10 plates resulted in the isolation of 1064 independent clones that 
were confirmed to be positive on secondary screening. Using 600 bps as the 
average insert size and 500 as the average colony number per plate, it was 
calculated that 1064 positives reflected an estimated incidence of one CA repeat 
clone every 31 kilobases in the dog genome. 

The first 14 CA repeat loci for which primers were designed are presented 
in Table 2 together with the optimal annealing temperatures. 
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The complete set of microsateliite markers is set forth in Table 2A below. 
These markers were identified and the primers designed as described above. 



WO 97/31011 



PCT/US97/02396 



- 12 - 

Table 2A 



] Mark* 
J Loan 

IC00103 
| COO 104 

Icooin 


CTACTCTJ GTATTCCATCAAT 
TG AC AT AAGCTGTG AG AAG AC 
TACGGAGCCCACACTACTGA 
AGCTTCCAOGTCTGGTTTTCCAAO 


Alii 1 CCCC ATTCTC ACTGCT 

ai ioaaactoatagagaagag 
tccaaoggaagtcatagaac 

TATCCCAGACXTTTAGAGCCTGOCA 


PCR 
JYoduc 
(bps) 
242 
140 
226 
174 


Motif 

_ (GT)2\ 
TGT>9 

__ (G-nn 


iCOOilj 
| CO01 14 
JC00203 
JC00203 
jc00215 


TTTTTG ATGGCTG AAT AAT A 

CTGCTTCTCCCTCTGCCTATGT 
AGGGTGCCTAACTGACTOAGCC 
AGGGTGCCT AACTG ACTG AGCC 
TGCCCC1 I AAAG ATTTT ATTT 


GAATGQATAAAGAAGATGTO 

ctaccacagccaatgttgattga 

1 1 icaaaatgggctttccttt — 

cctgcatcgaacctgcttct 


82 
"740 
162 
162 
127 


iGT)I I 

(ui )I4 

I (GT)12 

__ (AC) 17 

(AC)17 

(CA)I0ACT(AC)12 


IC0Q2i7 

IC00304 
|C00403 


tcctgcatggagcctgcttct 
gcaccacttgtaacccttgaac 

ATGGAGCCTACTTCTCCCTC 


TGTGTArrCACiA7GTGCTACTTGG~ 
TCGCATAGGATGATGAATAATA 


181 
181 


(AG)4 

T11A2G(AT4XAT I 
3)2(AT2)__<AC)10 
.-.(GA)16 ! 
CA>4TA(CA)12 


|C00412 
fC0030) 

jcooica 


ATC AGTCC ATTCTG ATTGGCT ATC 

ATCACATCCAAATCAAGACTAT 

TGACTTTACCTTACTTCACCTT 


GAC i i GCTG TaI "1 GGTTaCACT 
GAAAATGGCAGTTGTACCTGAATCT " 
TGTCCTATGCCTGTCCTATTAT 


123 
209 
172 


<TG)I1 
i (TO)13<TA>4 
(AC) 15 1 


tc53553 


AGGGCAAC 1 J GGTTAC AGATTA 


109 


' (CA)3T(AC)2C2(C 1 
A)6 


j CO03O6 
JC00602 
ICOO603 
{COO607 
I CO0613 
|C00802 


CAGAGCCTTCAGATAACAGTA 

catatccatcctcctaaactttc 

CCAGGAAGTTATGATTCTAAATGT 
CTTTTCCTATTGTCACAAATO 
AGTCCCACATCGGGCTCTCT 
GTGG AGCCTGCTTCTCCCTCTG 
TACCT G AGTC AGTTT ACCT AGC A 


An A iTCTTTCCCTTTTCTAC 

AGTGCCTAAAACTAAC AG AACTG 
GAGCTTCCTTCTCCCTCTGCC 
AC AG ATG AATG AATAC AGTTG 
TGCTGGTT1 CTCTCTl GTGTCTTAT 
C i i t:c AAGTGC AAAC AC ATAGC 
GTTTCTACAGTCAACCAGATG 


230 

173 
214 
107 
169 
191 


(GT)9T(TG>4<TA>4 | 
<TG)7 

(GT)2A(GT)9 1 
(AC)7(AG)8 
(TG)i2 J 
(CA)9TA/CA>4 | 
(CT)l(A3T)n I 


JCO0803 
1000901 
JC00902 

jcoiooi 

] CO 1 003 
j C0120J 

JC0I207 
IC01212 


T AAG AGTTATGCCACTTG ACC 
1 TAAAGGTCCATTGATAG AGGA 
GAGCCTGCTTCTCCCTCTG 
ATGGGCTCC AAG AATAGC A 
GAAGTAAATCAACAAACAATCA 
ATTCTTTCTATGGCTAGGCAGT 

AGACCACTCTGCTCCCTCTT 
AGGTGTTCTCACTCCTCATA 


CCAGGGAAGAGACCAGTATATGA 

tgatcccagoagttcattctt 
tgj 1 1cj icaatgacctttcag 
accagaaacttcattgtctcc 
gaagcaaaagtataagagctgtg 

TG AGTTTCTCCCTCTTTCTCT 

TGCC T I G AAATG AACAATGA 
CTCCCTCTGCCTGTGTCTCT 


185 

100 

105 

173 

219 

87 

150 

84 

115 


(GT)19 J 

(GT)12 1 
(AQ12 I 
(CA)I4 

(GA)12 | 
(AC)ll J 
(GT)6A(TG)5A(TG J 
13 I 
(GT)15 I 
(CA)10 I 


IC01304 
JC01305 
|C0J4Q3 

1C01406 
IC01406 
ICO 1-409 
{ CO 1505 
4cO160I 
J CO 1606 


CTGAGCAAGACCCATACCACTT 

GCATGAGATAAGACACCACCTGTT 

GAGGCTGACAACTGTTTGCTA 

GAl l i IATTCATTTATCCATGAC 

TGGTGAAAGTAACTAAGAACA 

GTTCTTCCCCAATGGTATTTA 

TCTGCCTATGTCTCTGCCTCT 

CCTGCATGGAGCCTGTTTCTC 


CCTCCC C AG AAC AATCT ATTTC 
l"l CATTTCCTGCCTCCTGTG 

ggagataaatgatgagaactca 

ctccctctgcctatgtctctg 

ttgcataaoagccagcaaact 
ataagatacacoaaccattagcc 
CAi i ictggaagacatactggta 


180 
136 
284 

107 
150 
246 
109 
145 


(TG )7T AfTG )4 J 
(GT)9 1 
(AT)2T(AT)7CA(G | 
A)4_(CA)7(GA)2( 
CA)2 j 

(CA)I6(GA)16 
(CA)16(OA>17 j 
(CA)6A2(CAtt j 
(GT)13 1 
(GT>7 J 


1C0I801 
]C0I802 
JC01908 
IC02005 " 
IC02122 

C02401 1 
C02509 " 


ATGCTG7TGATTACACAGACC 
TCTGATTTl CACCCTT AGAAC 
ATGCAAGTTCTAAAACCATACTG 
ATCAAGTCCCACATCAGCAGCCT 
GAGTAAAGAAAGAGTT JG AACAAT 
'Vi ui wAUUV i LLL 1 LfCATLKj 

ZCAGACCCAATGACATCTCC " 
rGGCCTAAACACCTCTGACAT ~ " 


atcacttcctggtattcacac 
gca gttttcctgtctctctt 

TAGTG AAG ACAGGaTTGTCTTG 

agtggtatgaggggcataaggaa 
agttggagaaatgagcactta 
gttaaatgtaagatgtc" cagccttt 

acccaggtgccctcttatcc : 


109 

144 

137 
189 
146 
149 

136 ( 


(GT)19 | 
<TG)10(GT)9 j 
TG)19 J 
(CT)\0 J 
fGT>10 1 
(CT>4GT<CT)6(GT 1 

OT)I8 


C02511 < 
C02601 < 

C026O4 1 


3ACATGATTACCACATTCATC < 
7TCCCTCTGCCTGTGTCTCT ~" " 

rrCACCCAGAGGATGCTTTGAA " 


rGGGATACAAAGTAAATOOAAC 

3T AC AACTG AAG AG ACTG ACC 5 

rGTTAGTCTTAGCCATTCTGA 1 

IT AACCTG AG AAC ATGGC AC AA J 


189 ( 
>7 < 
144 < 

90 ( 


CA)18 1 
GT)16 I 

:GT)8(Cr03_(CA) 

S 

CA)17 | 


C02608 p 
C02705 1 


KGGGAGCAGGTTTGTGGTTG 
jAGTGATTCTCATTCAAAAAGGGA 1 


"ACTTCTGGTCCAAC A 1 "1 TCC 
rCAAGGGCACTTTCTACTGTGTA 


10 ^ 
16 < 


GT)19 1 
GT)I0 


C02709 C 
C027I1 1 


rrCTGCCTACGTCTCTGCC C 

•ctcattcaaaaagggagatgc 1 


:accagtatgctgatataattct 
ttcaagggcaci 1 1ctactg 


42 ( 

09 ( 


CA)18 f 
GT>10 j 
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Table 2A (conL) 



C02712 


GCTTGG ATGCT ATTGGCTC AA 


CAATGACTTGGGAAACTACATTC 


156 |(GTV22 


032*02 


CCCTGCATAGAGCCTGCTTCT 


AUCI : 1 1GCI 1AI IAIA1GC1 1G 


186 


(GT)6{CT)2CA(TG 
*<T02rAn-n 


C02805 


GACAAGAACAGGTATGAGAGC 


TGTTGAGTGTAAGATTCAAAGC 


118 


(CAJI2 


C02J06 


TCCCTCCTCCTGTGTCTCT 


CTACACCTCTGAAACTACCA 


139 


(CT) 1 1 GAG(A3T4) 

(CT3)A6<TA2)3(T 

A3) 


C02903 


CCTAC ATGG aacctgctcttc 


TGTCl I 1 CCCTCAACAAGATG 


167 


fTC>4TG<TC)6 


C029H 


ATC ATGGG AG AGGGTGGTAT 


GGGTAG ATAAAGACC I G 1 AAG 


122 


(CAM 6 


C030Q1 


TTCAGAGTTAATGATGCTTAGG 


GAGATTCTCTCCCTGTACCAC 


153 


fGTV(GA)I7 


C03102 


ACTTGTGTT ACCCC1 J » fACC JCCiGCCI'l iAlGGAUi 1 I ACA 


108 


fCA15TA<CA)15 


003104 


TCCCTCTGCCTGTGTCTCTAC 


ATC AATG AAACAAAAGG AAC AGTA (147 


(Cm\9 


C03I09 


CCTGCATGGAGCCTGCTTCTC 


CACACCAATTAAACAATAGACATT 


185 


(GT>16 


COJJOJ 


CCATTCCCATAGAGAGGAA 


ACCTAGCC AGG ACTGG AAAG 


118 


(CA)TTA 

(CAM1 ! 


C03302 


TGAGTATTATGACCTGGAGGGT 


TCAGTAGGTTGTGTCTAGCCT 


97 


rGT)iicrrG)5 


CQ3302 


TCTCAATGATACAAGAACTTCAC 


TCCAGTCACCCTCCAAGATGT 


185 


(AT)11(TA)8(CA)1 
6 


C03304 


ATTGGC ATC ATTCC ACTGGTC A 


TGGAGGCAGCTTAAATCTCAACA 


95 


TAC)16 


C03308 


t TGATAAGAGTGTGAACAGAGAAG A 


CTAGGAGATTGTACAGGTGCT 


275 


1 ( GAV20 


C03401 


GGTCATCTTl ATACCATCAATTAG 


Clll AATGCT GGC AG ATGCT AT 


104 I(CA)10 


C034O4 


CAAllClClClAlCXJCICM IGI 


TCTTCTTGATTCACAGCCAATCT 


171 


(CT>4T(CT)2GT(C 
T)10CCA)18 


C03301 


ICGOAGAIOOAAACJ 1 J 1 G » AAGAG 


TCTAGTGGaCTGTTCTGAATTTG 


106 


(GT>21 


C0O507 


ATCTCGTAATTTCCCATAATACTTA 


ATCAAGTCCCACATCAGACTCC 


161 


(GA)2(CA)5TA<CA 
y6(GA}6 


C03508 


T ACTCC AATGGC AAC A GTTT A 


i I auaCCa I CT ALt. IC I I I'lC 


110 


(CA)3G<CA)17 


C03509 


C ATTCTGCTC ATCTCC AT AA G 


GGC ACAACT AACTC A 1 I 1C1 AT 


188 


(CA)13 


C03510 


CCTGCATGGAGCCTGCTTCTC 


TGGCT ATTT ATGG AGC ATCTCTT 


I 


(GT>I9 


CQ3512 


GAGCCTGCTTCTCCCTCTG 


GAGACCATAATTCACAATTCTTC 


113 


(TC) 12ATG A2T(A 

^\T-» A- 


CO360] 


A GCCTGCTTCTC CCTCTGTC 


TOTTGCTT ACCCTTCTGTT AG A 


151 


(CTp(CT) 1 0{CT)2 


CO3607 


AGTTCCATCCACATCGTTGCA 


AGAAAGAGCCTAGATGCCCAT 


14] 


(GT>U 


C03S10 


TGCTTCTCCCTCTGCCTGT 


GOCTG T AA G ACGC AG ATTTCT 


134 


(AC)I7 


C038I4 


ACATTGGGTTCCTGCATGGAG 


GGC agtttggtgatgtctatcaa 


237 


(TG>I9 


C03815 


GTGCATGGAGCCTGCTTCT 


AGC TTAGCACCCTGC ATGG A 


161 


(CQ6 <TA3)2(T 

A5VT2A4VTA3>4 


i_ujyu / 


TAGTGCTCATGGAGCCTTTCA 


T A T/VTfi A TTrr A fT'T A r~V** I' f*» 
J A 1 I OA i J LLALL i ACv_ I U 


83 


(GT)I3 


CO 3 90 9 


TCAAATCAACTCGTGTTTCTGT 


GG ATCTG AT AATCC ACTTT AG A 


71 


fTG>8 


C03913 


G AAGGG AC AG AG AAAG AAATG AC 


TGT AA GGGCTG TT ACCTCT AATC 


333 


(TC>13{AC)12 


C04003 


GGGTCTCCTT ATC AC ACTG 


AGCAACACTTG AC ATT ATTT 


135 


(CA>12 


C04007 


ACCAAATGAGCCACTTAGGT 


CCTCTGCCCTTTCCTCTATG 


109 


(CA)M i 


C04103 


AATGCTGTGG AAGGTG AATG AT A 


ATGGAGCCTGCTTCTCCCTCTG 


224 


fCA>8(GA)4 


C04107 


TCAGCAACTATACATTTAAGAGCa 


CTGTCCCATCTAAAGGATAGO 


160 


((GT)6GA/GT)1 1 


C04t07B 


ATCGAGTCCCACATCCTTG 


C ATTT ACTGGTTTGTCAGTT AGG 


120 


(AG)1 1 


CO4107C 


TGGG AG ATG AAAAGT ATCCTC 


CCTGTGCCTCAAGATAGATO 


250 


(CA>18 


CO4201 


GAGTTCCTTCTTCCGCATCTAG 


ACT attcagaaagcagtacaacct 


120 


(GT>6A2fCmi4 


C04208 


ATCCTAGTTAGGCATGTGCTT 


GGTAAATTACAGCAGGTOAT 


205 


rGA)2(AC)l 1 


CO4302 


TGGTTATTACTG AGC AG AC ATC 


GL J I 1 ] G IT] CCTi CAAATAC 


168 


(GT>21 


C04601A 


AGAACCTATCCAGCTATTATAGTG 


CTCTC AG AT ATG ACCAACCT A 


2U 


rrc)i* 


CO4601B 


ATATACTTTCACTCTCCATGCAA 


AGAAGAGGAGTCTTTGGATG 


139 


(TG>18 


C04704 


CAGTTGCTAAGAGGTAGGTC 


GTAAATGATTACCATAATAAGGT 


114 


(CA)13 


CM7J6 


TTCTCCCTCTGCCTATGTCT 


AGCACCCTGGTACTGTTTCT 


133 


(CT)3(GT)9(ACTX 
ATC>A<TA3)2CTA8 
VT3A12) 


C04802 


TTACCAAGCTAAGCCTGGCA 


TGGAACCATC ACTG AAGGG A 


150 


( C6 A YC6TX AC U 0 


CM 802 J AGACCACCOAATGGATGGAGT 


TGGAGTAAGTAGCAATCCTCT 


144 


fAC)l7 


CO4805 


CTTTGGTCTCTGGT GGC AAT AG 


TGG ACTTGTG ATAC ACCGCACT 


207 


fCA)!7 


C 04806 


GCCTCACTCATCATCATTTTC 1 


G AAC AAG AG ATTCATATTTGCTATC A 


180 


rro>i8 


C04903 I 


ACTGCAAATAACCTGTAGAGTGCT 1 


ACCAATCACCATTCCCTCATTC 


157 


fAC>!6 


C04904 j , 


AAG ACTTC ACC ACTC A C AGTC A | 


CTGGCTCAGTGTGTATGAATG 


143 


ffCA>5T(CAMl 


C0510J it 


rTCTTAACCGACCTTGACACC 1 


AGAACTTGCTTATGAAGTCATGT 


20S 


fAC>15 


C05 i 02 I AAGCTGTG ATGTGGCTCTC AAC I 


CAATGGGCAGAAACAATGAGGA 


171 


fAC120 


C0MQ3 j ATTGGCATTTATCTTCATTGT 


AAG AGG AAAG AATCTGTGAACT 


196 


CGT)16T(GT)2A<T 


C03 n 0 I TGGAGCCTGCTTTTCCCTCT 


ACCCTGAGACCATGAGCTAAG 


185 


cmsrGTO 
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Table 2A (cont) 



CO 73 04 


CAG 1 i AC AT A 1 ALLAITAOllA 


1 ULL I CO 1L1 1 1UIL1 LLA 


109 


(CA)7TACG(CA)1 
0 


C07308 


ALA I 1 UUXA.t* 1 AA 111 AA J AO A l 


r.TPrTriT An APrPTTATAnTAn: ap a 


1 J. f 


(CAM 1TA<CA>3 


CO 740 3 


TOCCAJ LI 1 L I OA i OL iLJ JU 


i Lu 1 Oo 1 JL 1 IL1 Ou AA ILlU 


134 


(CA)14<T3A)I0 


LU fAU/ 


TP A TTP A Tr A A nTHI^TC A fiTTAT 
1 LA 1 1 L.A J LAAU 1 ww 1 w/\VJ 1 IAI 


v J J A 1 VAAJV 1 UUAUU I W 1U 1 A 


i *> i 
I J. i 


(CA)15 


CU / 4 I J 


TTp*r a PW* A fWl AAA PTPiTT ATfi 
1 I UALAULAUUUAAAW 1 O 1 i a i u 


Af!CCrATCAATCAAfiAOAA<TTTA 


IaU 


(GT>1 8 


LU 74 13 


A A riTtTHT A PT 1 P/ **l f STTP AT 
Ant lUlUl AU» 1 I wO JU1 1 Wl I 


ATTTA ATPfi A fTfi A ATfTTTTTr 


i n i 


(GTJ8 


CO 75 02 


p a tp a ppptp ap.a rrnTTA rrrfiTT 

LA 1 LALIX 1 LAUAL 1UI 1 AO lull 


OwA J Iwl J Iwl OVI 1 UUVJAUUA 


1 BL> 


(GT)1 1 


CO 79 02 


UAL I OA 1 (AJ 1 UUAUUU 1 O AO 


Iwl O Aww AO 1LIU1 AAAL I Aw 


9 1 


(OT)U 


COS 103 


f*T"l /*m AAA TP.T A ATflTnTm' A 
Li i UU AAA IU1 AA iUiUIUi A 


wrtO I 1 O 1 OA IAI 1 1 Jul 1 1 | LAO 


9 1 


(CA112 




A J O il L 1 1 /\0\^w/\0 1 wrt. 1 AV^\^V 1 V_- 


111 VIAUVl A I VA/U aV 1 W A J w 1 w 1 A 


mi 


(GT)13 


COS 2 0-4 


TP A TPTAf *n PJTTnTnT AfiCC 

1 OA 1 L 1 Aw J 1 WW J vJ 1 V-» I AU\pA> 


orsACATAJVfiAfjn. atgtit; ap. a a 

OOa\w-/V 1 AAUAUUA i O 1 O AOA^V 


1 It 
1 I J 


(CA12 1 


wlVS-^J 1 


a AnrAfiATriTTPAArcArTfiT 

AAUvAvJA t Uv 1 l>AAl.CAl< 1 U 1 


A nCi A TCfi AfTTPPtT A nrTTT* A n 


1 


(CA)13 


C08413 


ACTT AACTAG AG A GCGTGTG ACT 


ACCTAC 1 I GCGTG 1 1 1 J AAGG 


135 


(GT~) 1 3 


CO 8601 


AT AT A CTTTC A CT CTC C ATGC AA 


AGAAG AGG AGTCTTTGG ATG 


139 


(o J )1 8 


C08608 


cacagaatactggaactcatttag 


AG AATCTT A TT GG TTCGG I'l ' 1 GG 


1 35 


(Ul 115 


CO8903 


AACTGACATCaACAGTCTGATAC 


CGAC I C 1 AAGATCG AG ACCTC 


1 86 


(LA)16 


C09004 


CTACATGGAGCCTGCTTCTC 


TG AAG AGG AATGG AATG ACTC 


138 


(0711 1 


C09107 


CCTGC ATGG AGCCTGCTTCTC 


AC AAAT AGGTGGTC ACTT A CTG A A 


150 


(L j )l4KO 1)7 


C09109 


Tfir, a nr* a a np ACTTTCTATAAAC 


GAGCCTfiCn CTPPCTPTr; 
unuv<u 1 wv t ■ w 1 www 1 w ■ v» 


148 


I (O 1 II O (OA)fl 


C09205 


CrrCAjUTAATGTiAAfiTGGCT 
w-w l wrvv^V l j i v_r\_rw ■ 


C AATCC AGTTATO AAATr.TTP A P 


1 23 


rG*o 1 4 


C092 10 


GGTGGCTC AG TGGTTT AGC A 


GGTGGTT ATG ATTG T ACTTTCT G 


149 


, /P AM C 
1 LA Mo 


C09211 


TCACCTACTG AG ATACTTCCAT 


CTGCCTATGTGTCTGCC 1 I C 


204 


( w f\) 1 


C092I3 


TTTC ACCTCTG ATT ATATCT AGG 


TGGATG<)AAoCCTGC n CI C 


140 


/APM fi 
1 AO HA 


C09215 


CC AGG AATAG AC AATGCCC A 


AACCCTAAGACC 1 1 J GTAATC 


255 


\ OA? 1 A 


C09217 


CTCTGCATAATGCCTGCT 


AAG ACT ATTTATTTATTC AT AG AC 


80 


ptom i 

< IUMI 


1 CO9220 


CCJAwlol J I ICIGIAItGGCA 


CTGC AT AAA GC CTGCTTCTCC 


165 


/P A \AT A ff* a > O 
ItAnl Al LA io 


C09303 


TCTGTCAATGGATaAGTGGAT 


TCCAGGTTTATTCAAGTAGTTAC 


129 


\L A M J 


C09304 


CTAGATTCATCCACGTCACTG 


CCATCAACTGATAGGGAAGAT 


129 


IT5TM 9 

( vJ 1 )\ j | 


C09305 


TTGCCATCACTGATACaAGT 


TTATTTCTCrrTCA7ATAAATAGCT 


181 


/P A VO i 


CO9307 


TTACCCTTGGCTATCTATCTAT 


CTGTTCCATC 1 1 ' J " 1 CCACCTTA 






CO9309 


TGG AGCC AGTTTCTCCCTCTG 


TGTTTC1 1 GATTTGGGTGGTA 


14 I 


(o-T)13 


C093I0 


tagaggatcaggtcccacgtc 


GCAGTGCCACGAATGAGTCA 


264 


(L i } 1 1 (o 1)17 f 


C09312 


aactgg aaaaatgg at aatcag 


TTGGAAAGATATTCACATTCAT 


1 A^l 


/■p A \Q 
^LAJV 


C093I4 j 


gtcactaaattcacgttattga 


Cm I ' I CTC AGTGTGTCT CAG AA 




(LA)BuiLA)6 


CO9403 | 


AGATTTGAACCAGGAaATTAGGAA 


CTTG AG ACTCTCTCTCCTCTGTCC 


182 


1 OA 17 \ 


CO9407 | 


TGTTaATCTTCCTaATCTTCCAG 


TCCACTGTTATTGGCATCACAT 


104 


\LA)J D 


C09413 1 


TGGAGCCTGCTTCTCCCTCTG 1 G ATCCAC ATCCCTG AGCTG A 


202 t 


fGTJ9 


C09601 1 


TGGAGCCTGCTTCTCCCTCT 


TGCTTC AAAGG AC AC ATC AAGGT 


138 1 


/GTi17 1 

\ w III/ 


C09607 | 


GCTGGJ icj n ctctatttatac 


TTCAAAGCTAGTCACTATTAGCA 


13! | 


(CAM 3 


CO9609 | ACTGCTGG7TCT7TCTCTATTT 


CK3TAAATACTTGAGGAATTAACATT 


IV 1 \ 


fCA>12 


CO9610 


CTAGCTTGCTCCACTGAoTTCC 


CAG ATGC CTCCCTaAAGATGTG 


163 ! 


fGTJ9 


CO9703 


CCTTCAGGAATCTAGGGACAA 


TGTATTTCCTATGCaATATACC 


— \ 


fCA>16 


CO9803 


GTGC CTGCTTCTCCCTGTCTC 


cacagcaagtgagagtgagca 


156 


(GT»10 


C09806 


GTAGTCTGCTTCTCCCTCTCC 


ttctcatatgtggtaactgagta 


208 


(CA-116 


CO9807 


CK^CAAATTAACCTATATTTAGAAC 


aaggcctcagacatgaactataat 


176 


fGT)6AT(GT)3 


CO9903 


TCCACATCCTCTTATCTGTTG 


AACTCAGTGGGACCTTCAATA 


148 


fGT)5ATfGT)n 


C09912 


AAGATGATAGCTTGGTCAaAGAG 


G AACC AGGT AATTCTTCT ATTG AA 


135 


rCA^8AA/CA)I0 


CJ0103 


GTTGGGCTCCCTACTCAGTG 


GAGTGTCK3AGACTGCTTAATA 


289 


rcA)ii 


C10104 


GGCAGATTTCTCaATACAGATTA 1 


TGCTCTCATAATAGACGAATCACC 


119 


fCA)12 


D00101 


ACTCTTCTCCATCTCCCTCTGC 


TCGTTGGoGTTAAAGCTCTGACC 


150 


(CA)9 


DQOiGJ 


GTACTTCCTCAGCTTTCCAATC 


CTCCXrTCTGCCTTGTCTCTC 


177 


(AT3>4^_(GA>4<C 
A)12 


DO0I09 


TGTATGwTCAAGGATTATCTOG 


TCTCTGTGCCTGTGTCTCTGGC 


127 JfCA)17 


D00401 


TGCCCTCACCAGGTGTATAGA 


GTGTG AAT ATG ATCTGTCT AG AAA 


90 | (CA)22 


D00701 


CC1 OCA i Go AoCCTTCrn c 


rGTATGCTCATTAACCATAGTCTT 


150 


(CT)\y 


D00704 


ATGGGGGaaAGCTGAAGGAGATCC 


rGTCAGACTGATAATAATGC 


459 


fCA125 


DO 1004 


rCCCTGCATGGAG<rCTGCTT < 


3AACCCAGATTCCAGTTGCTA 


246 


rrC)12+<GT)J2 


DO 1204 


rATCCTACCTCTACACTCCTCCTG 


rGAGAGTTAAGGGGGTT AATGG 


589 i 


r GT)20 | 


D012Q5 


AGCATGATGCCCTTCaAGGTC < 


3GATCTTTACCCGCATGTTCC 


201 ( 


'Crn2A2(GT)16 


D01208 


^CTCTGACAAGGTTCTGGCG ( 


3 AGTTT ATTTTGGTGGTGTC 


130 M 


f CA>12 


D012I0 < 


3CC AC AACT AC AC AAATAACTAA 1 TTCTACAGTGATGAATGCGAGT 


M3 ( 


CA»10 


D0I21I l( 


jCTTTTC TTCCC J 1 1 AGTGA 1 GTTCATAGCAGC AATGTCCAC 


127 i 


CA)23 


D0I212 1 CATAATAATTCCCACCACTACT 1 GGAGCCTGCTTCTCTCTCTG 


133 ( 


CAM 7 


DO 12 14 j ATCATTGTAAAOC AACCTCTC | TTCTCCCTCTCCCTCTGCCT ; 


254 ( 


CA)5(-GA>6 
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Table 2A (contj 




D03707 
PQ3708 



D03709 
D03803 



AGAGCCTAOATGCCCATCAA 
TTOAAAOAOATAAOGAGTCTOOAG 



ACATTTCTGAGTGGCATOGCT 
GTCAACAGCTTAGAAGTCACCA 



TTCACTTAOCGTAATATCCTCT 
TGCAGGTCCGACTCTAOAOOAT 



ACTCCC AAATCTTC AC AAAGG AA 
ACTATTATGCTGTATGGGTGCAA 



156 
82 



86 



fGT)20A/TO)10 



(OT)3AiCr\5 



90 



P038I3 
P03831 
O03X23 



D03908 



DO4I01 



D04402 



CT AAGATCAAATCCCACGTC 

CCACCCAGGCATCCCAAGA 

ATCTGGCTCCCTGCATOAAG 



G ATTCG ATCTG AGTTAGCAC 
ATCTCAGAGAOTTGGAATCAATC 
ACTTGTTTTCXCTCATATCTGTT 



(AC)12AAT(AC)3 
A/ACP 



TACACCTG ACACTT GTATCC 



CTGCATGGAGCCTGCTTCTC 



GTGCTTGTTAGTCCATOACC 



CCCAGGCACCCC C1 I I IVIC 
CTATTO ATTTTTCCAAAOC 



G AAT ATG ATGT ACC AOGTOTGG 



ATCAAGTCCCATGTCAGQCT 



94 

171 



179 



£}5£TG> 
AQ19 
(ClllOacS— Iff" 
A3VTA4YTA3Y9 



(AC)I3 



<TQ>I6 



D04403 



P04501 



ACTAG AAO ACAOCAAAATQ A 
G AACCTGTTTCTCCCTCTGCCT 



GTC7TTCATGTTTTCATATACTC 



AGQAATCTOCTTGGATCTCT 



130 



176 



ESDI* 



GTCTCTC 



I I GCCTCGT AG 



138 



GCAATCTATTAGTGGGGTCAT 



CTOACTCACAGCCTGAAATCTAT 



224 



(TG)14(GA)3GC(G I 

±2L 



TTGTC ATTO AGO AG AGTC AT 



CCACTOCAGAATGTATCTAAAC 



(CA)3TA/CA13 



TTGACTAAGGGACTCTCAG 



TGGGTGGCTCAGCAGTTTA 



254 



(OA)3{CA)10(GA) 
14 



CTGCTTCTGTCTCTQCCTAAT 



TCCCTCTGCCTGT G TTC1C1 Q 



280 



(CT)1Q— (CA)13 
(TO)10AG<T0)3 



D0461Q 
D04613 



AGCT ATCTCTTCATTTG ATCT ATCC 



ATCCAAAGACAATTCAAAOO 
ATCTCACTCAOAOCGAAAGCT 



CTACAAGGACAAGTGTGTCT ACTGC 1 223 



TTGGGTCTATTTCTGGGTTCT 
CG AGTTCCAAATCTT AC AGO 



133 
293 



(GT)l0fAT)7(AO6 I 



1XM6 1 4 j ATC^AGTCCCACATCGOOCT 
P04616 ( TCTC ATTCTTtrrrTATGOCfGT 



OTOGTTCTTAT VL I 1 1C1CI I ATC 



154 



fcT>i2rcmi2 



ATGCACCCTTATGTTTATTGCAG 



167 



D04617 
DO4702 
DO4710 
D04810 



AGGATGAGGTAGGAGTCAOAA 
G I (J I I O^AAGTCCTAAG AGC CTACC 
TCCCTGCATGQ AOOU J I C1 1 
CTCCTCTC 



GCTATOCTTTGOGATOACQTQ 



ATDCTOCT CT ACCCTCAQ AQQC 



271 



c attc attct aactto aototc 
atoaactctocacttoocgt"" 



112 



(GT)14 



«CA)12 



17 

rro>i4 



TC AAGTCC AC ATCAGGCTTC 
TCCCTGCATGOAACCTGCTTC" 



ACCTGGTGCTATCAA G T C 1 U I 



189 



(CA)19 



ACTOGGTTTAGTTGO ACT CC1 ' I A 



190 



(TO)U CA3fji2| 



TGGAGTCAGTAAAGCAOOCTA 



TQAGroAC TOTO TT CTATCTTGl 



122 



rrO)lOTAGtCCTO ] 
)41^lAfTO)7 



D04907 { TCG ATTG AGCCTCCC AAAT AACT 



CCATCACOCGC AGTC1 ti l AAT 



216 



(CA)13 



D04911 TGATAGACACTTGGGTTGCTTCCA 



ACTCTTGGGCATTTACTCCAAGGA 



164 



(AT)5(GT)9(AT)5Cj 
(AT)7 
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Table 2A (cont) 



D03005 



ACATCGOGCTCCCTGCAT 



ACCGTGCATGTCGC C AC A 



232 



(AC) 13 



D03008 



TCCCTATATGGAGCCTGCTTCT 



GAAGCTCCTATTTCXrCTTTCACCA 



IDO50I2 



G AAACTTC AT ACGCAG AC AAATG 



AAAGTACCTATGGTTGGAOCATA 



200 
136 



(CA)13 



(CA)I7 



I DOS 101 



AQGC ATCAGG AAAT ATT G TOGO A 



AQAAAACACACCCAGAOACAGO 



(D03120 



ACTCTOCTGTATAO AC A 1 C 1 " ! G I 
ACATCAGGCTCCCTACATOO 



AGC AO AGO A CT ATGGG AAAT AAC 



163 
108 



(GT)16<OAttl 



(TO)12 



DX-4 
IE004Q2 



CTCACTCAGOTTACTTOGCTOC 



170 



TCACCGTTTTAOCCAGTATTOC 



TCCATTCCTOATCGAGTTCTO 



212 



(ATTT>3(AC)3(AG I 

ILL 



[E00409 



Tocrn iggatgoagctoaao 



TG AG AGO ATC AG TTTCTGTTO 



(E03906 



AOCAGTOAAGTTTAATOAAATAC 



211 



GCTCAOO AATTACC AG AGO AO 



83 



(CT>Z(GT# 



(CT)12 



E03909 



AGCACTTACAGOGTGTCGTCGTA 



GACTTCCCAGTTGACTAAATAAGCTA 



214 



(CT>9 



(£03912 



TGTOGAGTCAGCTTCAOATTC 



GCTAAACCACTGCACCACTGG 



(£03913 



130 



AAACAAGTGOOOAGOGOAOO 



CTTO ATCG AQCCCT GC ATTGG 



(EQ39I4 



TCAGTCCCACATGCAGCTTCTG 



117 



(T018 



GTG AG ACCAAATTGTT ATTGT AA 



202 



(AG)4C(GT)7 



[E039I7 
IE03920 



AGOOAGAACAOATACTGACTCAA 



T AATC AGCCTCT AAGG ATTCTGG 



216 



(AO)H 



CTGTOTG AAOCCTC C ITCTC 



AOCCAGTCATGTGCCCTTA 



1 £03922 
E03923 



CACATTTTACATAAAAATAATATOCCA 



132 



CAGTGCATGO AGCCTG C VI CT C 



192 



(CT)9G<TC)3 



(AG)P 



CTOCATGOAGCCTG C 1 1 CI IC C 



GTTTCAGCATCTGCACCAOOAT 



172 



(CT)14GfTC\3 



[E04001 
(£04007 



TCAGC ATGG AATCT ACTTO AO 



OAATGTGAGTACAAAOOTAGO 



76 



GCTCATTGTOATTCCTTAAAACAO 



CTGOOGTCCGCOATGGAGT 



202 



(GA)3AfACM3 



E04008 



GGTAGCCTG C MCI C C C T CT G 



E040I9 



GCCCTCACTGO ACATCM 1 ATT 



ACCAGTOATTCCCTTCACCTG 



143 



} TGOAGCCTGCTTCTCCCTCTG 



116 



(CT)12(GT)$ 



(OA)13 



1 £04021 
I £04104 



CAG 1 riGGAGTCTG CnClC CCT 



j atcacctgaattgcagttgtca"" 



IS2 



(ctho 



actaggcatctcacatacattatt 



cctgcttctccctctgcctat 



109 



<AO>12 



{E04103 
1 £04107 



CCTGGAATGO AGC ACCATGTC 



atacttatgtccctgoctctg 



168 



(CT)8C2T2fCT)6 



ctccctctgcctatgtctctg 



ccaagcagttttaccacgata 



110 



(CT)12 



I £04108 
[ £04401" 



cttctccctctgccacttc 



ttctttatttgacaggoaaa 



98 



(CT)I0_...(CT>S 



cctggcatgoaocctgctt 



gtttttaggtct ac actt ctg agt 



122 



(CT)9CGTO 



(£04402 



I £04403 



TG AATC ATTATGGTCCT ATCGTTC 



IE04404 
IEO4407 



TGCATGO AGCCTG C 1 1C1C 



TAAAATGCAAGTCTTACCAGAGGAA 



OCCACATAQACACTTGOTCTT 



CCTTTCATTOAATATCTOTCAT 



111 



123 



CGGOATGOAGCCTGCTTCTC 



14 



GOAGCCTGC 1 J CI i CCTCTO 



CACTAGTAGCTTTATAA7TGTGCT 



124 



CCT)14G<TC>4 



(£04408 



(£04409 



TGCTTCTGG AAACTGCAC AT 



AGCCTGCTTCTCCCTCCrC 
i GAGATCGAATCCCACATCAC 



TGCATGO AGCCTGC 1 1 C I C 



GTTTTTAGTCTAC ACTTCTG AGT AA 



111 



(AG)12 



(C*Q9ffG)3 



JE04411 
IG00203 



CCTA CTCTTCC ACC ATTTTGCC 



166 



(CT)U 



IGO0402 



CTCTGCCTATGTCTCTOCCT 
GTTTG AACCCCTGCC ATAGGTA 



tgtatgtctatttttgtgccagta 



CGGAATCGAGTCCCACOTCA 



164 



173 



<TC)13 



(CA)3(GA^20 



(G00301 



TGOAGCCTGCTTCTCCCTCTG 



ATGCCCACGTCAGGTTCTCTG 



occaactctttacatctgtgcta 



148 



gttgttccagtattcattcattc 



171 



ISQ1L 



(CT)!! 



1 000304 



cctgctcagcagagagtcto 
agtgcgtggagcctocttct 



G ATTGG ATTATTTGT 1CI1 GO 



161 



(CPU 



1000308 
(G00312 



CATGTACTGOCCCATCATTCT 



196 



rcnu 



[G00602 



CAGOOCTCAATGAGTGATGTTA 



IG00603 
1 000703 



CGAGCTGCTCAACCGCTCAAC 
TTCTCCCTTTGCCTGTGTCT 



TCAAGTCTTGCATCOCACACC 



TGG AGCCTGCTT CTC CCTCTG 



138 



otctatcagagcaccagcttca 



187 



190 



<CA)13 



(OA)19 



(CD11 



cttctccctctgcctgtotct 



aagttctgtattgatttcattctc 



206 



(TC)6T3(TC)7CA< 
CT>3G(T03 



1 000704 



cotcctctgaatccctgtctat 



gtgoagcctgcttctctctttg 



223 



IGO0707 



citctccctctgcctatgtctctg 



gaaoocttagcaagagttgaaoa 



189 



(CT)9T(TC)3- 
17 



(CD13GACTATC 
ACTA3)2CTA3XT2 
A1 0XTA6)2 



CCTCTCCCTCTGCCTGTGTCT 



ACCTCTGAATCAGGAAATGTAACT 



132 



IG00709 
IgO0712 



ATCGAGTCCGACATAGOGTTCACT 



AAACAGTGTAAACAACATGCTACC 



132 ifCT)12fGTWCn3 



ATCGAGTCCCATOTTOOOCTCC 



G00713 



CTGG ATGG AGCCTGC 1 1 CIC 



C00801 



tgcttatgcgtactctctctcaa 



tgagcagoogcaataggagacttc 



226 



gcgtatctagtgatgccacttct 



194 



tccctocatogagcctgcttc 



XS4 



(CT)9G<TC)3ATG( ] 
A2T2XA3T)2(A4T) 
fC2T2A8) 



(CT)10T(TC)3ATO | 
(A2TXA3T)2(A4T) f 
fCT4A4) 



(CT)i2GAC<TCp 

ATG<A2TXA3T)2C 
A4TYCT3A9) 



G00SIQ 



ctccctctgcctacgtctctc 



agaagttactgtgtccaagtacaa 



132 



(CT)17(GTKACTA| 
TCAT(A3T)2(A4T 
3VAUT) 



SUBSTITUTE SHEET (RULE 26) 



WO 97/31011 



PCT/US97/02396 



18 



Table 2A (conL) 




SUBSTITUTE SHEET (RULE 26) 



3NSDOCID: <WO 973101 1A1_ 



WO 97/31011 



PCT/US97/02396 



- 19 - 



Table 2A (cont) 



EHE3I 


i a rrr a rj".TnTP/TTr A A A ATriT 
| ACULAuU lui wA- 1 1 ^/wvft lul 


j GCT CTCCC 1 C 1 GC CTGTGTCT 


206 






| A T C3G AGwC J LA- J t L. I LA—w 1 w i 


TCArVTrATAAATTCAA.CTGGCTTA 


151 






1 GGCACGTTAGTCTAGTTCTCTG 


T AATC AGGTTCTTGG AO ATO AC 


139 


(GA>8AT(GA) 




f CCAAATTCAGGA 1 1 i C A G ACTCC 


a xnn a /"i*pv "i Pi/ "1 i / ■ T # 'OT*T4""T 




(OA) 12 




1 0C AGCC AAT ATG AC ATC ATOC 


TACATOOAGOCTGCTTCTCOC 


161 


(OA)S 




1 TGCATGOAGCXTGCTT CTCCC 


(JAAvAUC till UUAUUAwA-A, 


i tj 


(CDli 


IcNVMVfl 


1 TAGCTGTOAOCTOOGTOTOOA 


UOCAL I 1 UAL I I AA IL1 1 1 UAU 1 


1 1 A 


(CT)7 


GQ2813 


OGAGOATCCAATCCCACQ 1 C 


tp a i 'i 'i rrrrAn *i ATTAnTfy^AC: 

l L>A 1 1 lul LAU 1 1 A 1 I AU A vvAv 


A 


/•pT> ~>r~^r*rt"\ 

(w i >iL»C(L 1 p^(0 1 

>* 


G028I4 


"1 »l f^/ TT ATA i^T*A AAA AT^I 

TOCrrGCTTTA 1 Au 1 AAAAA 1 O 


1 lA, 1 (V>1 L-s-A_ A L 1 \~r\ — L- 1 A 1 


265 


i /p a% \<a*o a. \ » ^ 


002813 


TCL I Lrt— 1 UAA 1 A 1 UAvu 1 1 wA 


▲ AfiTifi A (VVVl aaaPP, AC AC AT 


154 


/PT\t € 


G02817 


a TV-»^ * a TP/V^ ATA TP A rWTTTP 
ATLUAA 1 La<UALA i 1— /\UVJw ■ w- 


V»^\L^/\^\ I ■ ■ a wju ■ r^. i »^ a a 


177 


AS- 1 


G02819 


a a r~Tr a py ATifiA flTfTrw " 1 " 1 f» 


C A CC AGO IT GO AAATG AAT AAG 


154 


fCTM 3 


O0282 1 


rrTfiT'APAfiAnrrTGCTTCTC 

w^W 1 UCALAUnUvW- 1 UV- 4 lUlv 


AAACCACTGAGCCACCCGGACT 


!53 


fCT^14 


C02902 


r; ATTf*. AHTCCTlACATCAGOCT 

KJ/K 1 1 VJ Aw A A W\WA» 1 


AGCTG TGTTT ATG ACTACACATG 


241 


fCT"2TGfCTV; / 

V w AAA W^W J ... ■ ,^ 

CT>6 




TAG AGCCTGC 1 1 C 1 CCCTCTG 


CCAATTTGAAGGATTCATCATT 


146 


(CT)2GT(CT)7GT 
ATCCTf8 


G03001 


TCCATCTGCCTATCACACCACT 


TG AGCACTGG ATGTT ATATGCAA 


199 


<TC)9 1 


L/vOLKJt) 


ATCT AATCCCAC ATTGGGCTC 


ATGGGG AGTC ATC AG ACC AGG 


171 


(TC)13 


fimrn i 

UUJUl 1 


TAGGCTTCCTGCTCAAGACAG 


GGATGGAGGAGAGGCTTGTTA 


209 


fGAT>S (TC>9 


UUJUJ 2. 


CTGCTCTC 1111 CGCTCACTC 


TTCTCCCTCTGC CTGTGTCT 


141 


(OA) 1 7 


GO30I3 


ACTGAGATGGGAAGOGGCAGA 


CTACATCGGGCTCTATGCTC 


83 


(OA)8 


UUJU 1 Q 


GAGCCTGCTTCTCCCTCTGC 


AGTCCTGTGATTAGTTCTCAGAC 


106 


{CT)10 


GO3017 


TCCTCCC AAC ATTCTAC AATO AA 




134 


fGABCA*TGA)9 


G030 J 8 


TGCTTCTC CCTCTGCCTGTGT 


CC11 CTOGATCI GCT 1 I JAClAl 


203 


(CT)13 


C03019 


CCACTCAGATGTCCCTATACTAT 


AAACAGGATCGAGTCCCACA 


212 


(GA)I3 


G03I04 


TAGCAGACAAACCCCAACTG 


GAGCCTOCTTCTCCCTCTG 


167 


(GA)13 


G03109 


CTGCATGG AGCCTGC I ICi 1 


TCTT ATTC AAATCCTCCTG ATT AT 


153 


(CT)9 


O031 ! 1 


CCI GCA1 GO AG ACT GCTTCT 


TGTTTCCTCACTTCTTACTOA 


218 


fCT)21 


UU36U1 


G AC ACC AGGTTG ATT ATC ATT 


TGGAGACCTGGGATTGAGTC 


166 


(GAMO 




ATCACACCCTGGGCTGAAGG 


TGGAGCCTGCTTCTCCCTCTG 


174 


(GA>14 




a/v: ATriTrPAnTTACAT'l IfiAA 

AvJVfA I UVAAvAVJ 1 1 r\ 111 \Jr\S\ 


TO ATfTTTTG ATGTTC ACGTTG AT 


208 • 


/p.AMf 


UvJUUi 


c a rrf.TriT a TrtTrrrm 1 attaa g 


wAUVAU 1 Ul AL- 1 A 1 IWVvl 1 vlU 


170 


^UAuU 


GO 5 60 2 


rArTAAArcArTfiAArAACCT 


VJ 1 vA.vAwU 1 l»AUVJw 1 V» 1 w | kJ 


1 JO 


fCt A >0 

(OAjy 


G05602 


papta a ArrAnrfiAArrArrT 




1 Jo 


/n a vo 


GO 5 6 04 


1 V^w/\ « vwvWU 1 VJV 1 1 W I w 


CCTCTTC ATACTTC A GC A A fTTG 


169 




wWUi 


CCCTTCTCTG TCTTTG AG AGT 

A * W A W A U • W A 1 J WAwflW t 


AGCXTTGCTTCTCCCTCTGCC 

/*V/V^ 4 WTw A A W 4 WWW- J Ws • VrvV 


144 


/ a a ^ w a a Yin/a 

\UAJ AVJ /7 U^U 

A15 


G06204 


CTTCTCCCTCTG A CTGTGTCT 


TCCCTC AAAATTC AAC AT AC AA 


168 


(CTU lfGAV3(CT)2 


GO6208 


CCTGCTTCTCCCTCCTCCTO 


TCCACAAAGCTCCCTACTCAT 


163 


(CTV10 


G0621 1 


CACTGGGCGTGTAACCTGCT 


CTGAAATGTAAGTGCAAAGOAA 


172 


(CT)12.-CA3C>8 


G06219 


CT AAT ATC AAAAGGTT ATCC AC 


CATC! ICC7C1GCCAGTGIC J 


267 


(GA)1 1 


GQ6221 


GGATAACCAGGATAATTTCCTAC 


AGAGAGGCCCACATCAGOCT 


156 


(AT4XAT3p(GA)l 
3 


G06222 


CTGCTTCTCCCT CTGCCTCT 


AATTTATGG AAATG TTCCC AA 


150 


fCDiTTrrcvj 


G06224 


GAGCCTGCTTCTCCCTCTGCC 


ACCCATGTATG AGCCC ATTG A 


!37 


(CT)19 


G06303 


CAGGTGCTGCAAGAGCTTAOA 


CI J CI CCC 1 i J CCCTCTGCC 


176 


(OA)17 


G06305 


GTCACGTCTTCAACCCTTCCT 


ATTGAGTCCCCCATCAGGCTT 


215 


fGA)14 


G06316 


AGCCTGC MCI CCCTCCTC 


CCACACCTCACACCCTGTA 


125 


(CT)I5 


G06320 


ACTOGCAATGGGTCTGAAAATAG 


CTCAGTTATTTGTGGGCTCTTT 


216 


(OA>13 


O06-40 1 


TGCrrCTCCTCTGTCTGTATCTC 


CAGH3TCXXXXTACACTAAGTG 


133 


(GA)IO 


006402 


ATGAATAGCTTGTGCATCAGTGCATT 


TGCTTCTCCCTCT GCCTGTGT 


132 


(OAM3 


G06407 


CCATCAAAC1 1 1 IACAGTOAA 


GOGTCTGC1 1 UrCCCICTCT 


163 


(OA)12 


G06407 


CAATCAAAC Jill ACAOTOAA 


GGGTCTGCJ IClCCXTrCTCT 


163 


(OA>12 


006502 


GTT AGOCTCTCTGTTC A OTOO 


COOTOATACC ITCC1 CATC AT 


146 


fCT)9 


006601 


TGTGG AAACTGCTTAC AA I 1 1 I C 


TOCCTTACCTTACAAAGTTATTG 


158 


fCT>17 


006602 


TCGAATCCCACGTCGGCCT 


ATGTT ACAATGATCTGATTTATTCT 


236 


fCT)5GT(TC)I2 


006603 


C ATTC AG ATGCGGG AGTTTC 


CCAOGTOAGGTCCAG I I G I O 


211 


fOA)9 


G06607 ( 


rrrCACAAGGTTGCACAAOAG 




139 


fOA>14 


G06608 


rGATAGGACACTT AGCAAAGOCT 


O AGCCTGC 1 ICICCCTCTOC 


194 


rCA)2fGA)12 


G06619 


4CAACCTACAGAATGGGAOAA 


crrrccACAGCCi 1 1 1 ait or 


196 


fCT^lb 


G06701 < 


JC \ 1 fl CACCCAACGACJ'I AG A 


AACTCTGTGGCTCAGCAAGC 


111 


f0A)12 


GO6703 < 


rTTCTCCCTCTGCCTGTGTC 


GCGTCTATAATCATCAGAAAT 


159 \ 


'CT>1UGT>4 


GO6705 < 


rrCTGCCrGTGTCTCTGCCTC 


CT AT AC AC ATTG AG AAATGGC A 


168 i 


TC>13 
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Table 2A (cont) 



H0350I 


TTGCCTT CT GGGTCTATTG ACTT 


1 GAATGTGG7TAGTAG AATTATACAG 


300 


(AT3)1CKAT2)2AT 


H03502 


GATCCTGATTGTTCTTGAG 


| GOCATGGAGCATACTTCA 


155 


<AT3>4 


H06601 


TGCTTCTCCCTCTGCCTGT 


TGGTGAAAGATTAGCCTGTGGA 


125 


(AT3)5 (AT4XA 

T3)2 


H06602 


AAGTCCC ACGTCAGGCTC 


ACGTCACCACAACCATCTAA 


l€5 


(AT3)12 


HO9205 


C ATTTGCT G AGTC AA GO AATTCT 


AGTT ACCTGO AA CTTGTC AG AA 


200 


<AT3)12 


HQS305 


TGCATGG AGCCTGC 1 id 


CTTCTACACATO 1 JU1COCT 


160 


(AT6XAT4>2(AT3) 
13 


H0920R 


AGTCC AGC ATC ACXX5TTTCT 


GAGGCTTA l n TCI GTCCAGTT 


144 


CAT3)9(AT4) 


H10101 


TC AGGCTC ATGGG ATTG AG ACTTC 


TGCC ATTGC AC AGC ATATAGGTCC A 1 


305 


(AT3)11 


H10103 


TCC AC ACTC AGTGC AG AATCTGCX I 


TGTGAGACCGCAGAATACAGTACTC 


141 
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Amplification reactions were carrion . 

descried above US in 9 the anneal e l ^ ^ C °^>°™ 

toucan PCR protoco , (Don R H et H H *" ™ ^ - 

estate. The Vari ab*^ 19:4008 (1991)) was 

each locus, 5-10 dogs were studied in ea ch breed Th ^ ^ ** ^ F ° r 

are presented in Tables 3A and 38. ° f a " e ' es ed 

Table 3A 
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Table 3B 



Marker 
Locus 


Doberman 
Pinscher 


Siberian 
Husky 


Scottish 

! Torriar 

| i erner 


English 
Pointer 


Greyhound 


D00101 


3 




i ^ 


3 


2 


D00401 


3 


D 


! ^ 


5 


5 


DO 1205 


p 




J 1 


3 


3 


DO 1902 


C 

o 


3 


| 4 


4 


7 


D02001 


o 
<c 


4 


I 3 


2 


3 




I 


3 


2 


3 


3 


D0201 1 


n 


3 


4 


5 


2 




O 


3 | 


4 


4 


3 






3 J 


2 


2 


1 


D03709 


4 


6 1 


4 


5 




D03805 


3 


7 f 


4 


5 


4 


D03908 


3 


8 T 


3 


4 


4 


D04403 


1 


3 


2 


3 


3 


D04702 


2 


3 f 


2 


3 


2 



In general, all of. the microsatellite loci tested displayed variability within and 
across breeds. While 9 cells out of 140 (6.4%) in Tables 3A and 3B were 
monomorphic, these were scattered though 6 different microsatellite loci, which were 
quite polymorphic in other breeds. The maximum number of alleles detectable by 
this analysis for a locus in a given breed was 8, in the case of locus D3908 in the 
Siberian Husky. The percent heterozygosity observed at each locus in each breed 
is presented in Tables 4A and 4B. 
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Table 4B 



Marker 
Locus 


Doberman 

■ II IOUI ICl 


Siberian 
Husky 


Scottish 
Terrier 


English 
Pointer 


GrevhonnH 


nnni ni 

uUU I U I 


DU 


0 


78 


86 


38 


nnndni 

uuu*tu i 




50 


86 


67 


100 




DU 


44 


0 


86 


25 




100 


63 ~ 


100 


100 


100 




a c\r\ 


57 


25 


50 


13 




n 
U 


50 


77 


71 


100 






33 


44 


43 


50 




U 


50 


17 


40 


0 




u 


U 


17 


17 


0 


D03709 


100 


78 
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frequency in the canine genome is accurate. These estimates have practical 
implications particularly, since most cosmids have insert sizes in the 30-40 kb range, 
the likelihood -of finding a useful CA repeat in a cosimd clone harboring a gene of 
interest is high. 

5 SPECIFIC EXAMPLE II 

Materials and Methods 

Patients and pedigrees. The patients and pedigrees used were primarily 
those used and described earlier (Yuzbasiyan-Gurkan, V. et al.. Genomics 15:86-90 
(1993)). Briefly, pedigrees of American Kennel Club registered Bedlington terriers 

10 were associated with the help of Bedlington terrier (BT) breeders. While all of the 
pedigrees have a family history of CT, not all had a symptomatic proband at the time 
of pedigree ascertainment. Diagnosis of dogs as to whether they were affected or 
unaffected with CT was made in all cases by quantitative copper assay from liver 
biopsies performed at 1 year of age or older by criteria earlier described. DNA was 

15 extracted from peripheral blood samples collected in acid-citrate-dextrose as 
anticoagulant as described (Yuzbasiyan-Gurkan, V. et al., Genomics 15:86-90 
(1993)). 

Microsatellite analysis. The microsatellite markers used in this study were 
developed as described in Specific Example I. Standard conditions used to amplify 
20 each marker locus in polymerase chain reactions (PCR) were as follows: 25-50 ng 
of genomic DNA as template in 25 //I of PCR buffer (50 mM Tris HCI. pH 8.3 @ 
25-C. 50 mM KCI, 1.5 mM MgCI 2 ). 200 /#M dNTPs, 200 pM with respect to each 
primer and 1.5 U of Taq DNA polymerase. A touchdown PCR protocol (Don, R.H. 
et al., Nucleic Acids Res. 19:4008 (1991)) was established to facilitate the robust 
25 amplification of most markers under the same conditions. PCR was carried out at 
94°C for 45 sec, 52°C for 30 sec, and 72°C for 1 min. 

The microsatellite markers were initially evaluated in ten sets of parents from 
the BT pedigrees. Those markers for which at least one parent was heterozygous 
were then evaluated in all the dogs in the pedigree. Seven to twelve microliters of 
product were run on a 5% to 7% Hydrolink D600 acrylamide horizontal gel according 
to the manufacturer's instructions with the following modification. During the 
overnight runs, a plexiglas gel carrier was placed on top of the gel to prevent the 
swelling and distortion that was otherwise observed. Initially, electrophoresis was 
carried out from 4 to 5 hr. at 50 V in 1 X TBE (90 mM Tris, pH 8.3, 90 mM boric 
35 acid, 2 mM EDTA) with ethidium bromide. A photograph was taken and the gel 
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The primer sequence and allele information about this marker are shown in 
Table 6. The allele frequencies were determined from alleles observed in apparently 
unrelated dogs. 

Table 6 



f Marker Locus 


C04107 


Repeat Motif in Reference Clone 


(CA) 5 CT(CA) n 


Primer Pair 


TCAGCAACTATACATTTAAGAGGA 
CTGTCCCATCTAAAGGATAGG 


Allele 1 and Frequency 


163 bp, 0.39 


[j Allele 2 and Frequency 


167 bp, 0.61 



Marker C04107 was used to locate markers C04107B and C04107C shown 
in Table 2A, which are close to C04107 and also contain repeats. This "family" of 
markers may be used to detect CT. 

A typical pedigree illustrating linkage to C04107 is shown in Figure 3. In 
Figure 3, circles and squares depict females and males, respectively, and individuals 
affected with CT are indicated by the filled symbols. The asterisk in the figure 
. indicates an individual not available for analysis. The bands are the negative image 
of amplification products obtained from the dogs indicated in the pedigree and 
analyzed individuals share the 2,2 genotype at this locus. In this pedigree, all dogs 
with the 1,1 genotype are predicted to be homozygous normal while those with the 
1,2 genotype are predicted to be heterozygous, and thus carriers of the CT gene. 

Given the finding of linkage and allowing for a small error for recombination, 
it is predicted that all the offspring with the 1 1 , 1 genotype are clear of the CT gene 
i.e., homozygous normal, and that all 1, 2 offspring are carriers in this pedigree. 

Since data on the ESD and RB1 loci were available for most of the dogs from 
a previous study (Yuzbasiyan-Gurkan, V. et al.. Genomics 15:86-90 (1993)). the 
linkage relationships of these loci with C04107 were was also evaluated. Neither 
ESD or RB1 were found to closely linked to C04107 (see Table 5). 

As demonstrated by the pedigree illustrated in Figure 3, given an informative 
mating, it is now possible to identify all the genotypes in the offspring, distinguishing 
between the homozygous normal, homozygous affected and heterozygous dogs 
provided the genotype of one affected dog is available. However, C04107 is not 
extremely polymorphic in the BT population, showing only two alleles and a 
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WE CLAIM: 

1 A primer comprising a polynucleotide, wherein the polynucleotide has 
a sequence selected from the group consisting of the sequences of Table 2A. 

2. The primer of Claim 1, wherein the sequence is the Sns sequence of 
5 marker locus C04107 of Table 2A. 

3. The primer of Claim 1 , wherein the sequence is the Asn sequence of 
marker locus C04107 of Table 2A. 

4. The primer of Claim 1 , wherein the sequence is the Sns sequence of 
the marker locus C04107B of Table 2A. ' 

10 5 Tne primer of Claim 1, wherein the sequence is the Asn sequence of 

the marker locus C04107B of Table 2A- 

6. A method for amplifying DNA, comprising the step of performing PCR 
with the DNA and a primer set selected from the group consisting of the primer sets 
of Table 2A. 

15 7 - Tne method of Claim 6, wherein the primer set is that shown as the 

Sns sequence and Asn sequence of the marker locus C04107 of Table 2A. 



8. The method of Claim 6, wherein the primer set is that shown as the 
Sns sequence and Asn sequence of the marker locus C04107B of Table 2A. 
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